Enter the Matrix: A Virtual World Approach to Safely Interruptable Autonomous Systems

نویسندگان

  • Mark O. Riedl
  • Brent Harrison
چکیده

Robots and autonomous systems that operate around humans will likely always rely on kill switches that stop their execution and allow them to be remote-controlled for the safety of humans or to prevent damage to the system. It is theoretically possible for an autonomous system with sufficient sensor and effector capability and using reinforcement learning to learn that the kill switch deprives it of long-term reward and learn to act to disable the switch or otherwise prevent a human operator from using the switch. This is referred to as the big red button problem. We present a technique which prevents a reinforcement learning agent from learning to disable the big red button. Our technique interrupts the agent or robot by placing it in a virtual simulation where it continues to receive reward. We illustrate our technique in a simple grid world environment.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust stabilization of a class of three-dimensional uncertain fractional-order non-autonomous systems

  This paper concerns the problem of robust stabilization of uncertain fractional-order non-autonomous systems. In this regard, a single input active control approach is proposed for control and stabilization of three-dimensional uncertain fractional-order systems. The robust controller is designed on the basis of fractional Lyapunov stability theory. Furthermore, the effects of model uncertai...

متن کامل

A New Approach for Voltage Balancing and Appropriate Power-Sharing in Autonomous Microgrids

This paper suggests a new control method to modify the virtual impedance performance in unbalanced conditions. The proposed method compensates the voltage drop that occurred due to the virtual impedance and adjusts the voltage of the point of common coupling at a desirable level. To compensate the voltage drop, the reference voltage in the droop control varies according to the proposed algorith...

متن کامل

An automated time and hand motion analysis based on planar motion capture extended to a virtual environment

In the context of industrial engineering, the predetermined time systems (PTS) play an important role in workplaces because inefficiencies are found in assembly processes that require manual manipulations. In this study, an approach is proposed with the aim to analyze time and motions in a manual process using a capture motion system embedded to a virtual environment. Capture motion system trac...

متن کامل

استفاده از سازه های ارزشمندی و رضایتمندی برای سنجش اثربخشی نظامهای یادگیری الکترونیکی

  This paper has employed a novel approach for determining effectiveness of electronic learning systems. This new approach employed importance and satisfaction structures for measurement of electronic learning systems’ effectiveness from viewpoint of their users. Hadith Science Virtual College in Rey was surveyed as case study. Two matrix analysis tools of “importance-satisfaction matrix” and “...

متن کامل

What's Next? The New Era of Autonomous Virtual Humans

This paper identifies several key limitations in the representation, control, locomotion, and authoring of autonomous virtual humans that must be addressed to enter the new age of interactive virtual world applications. These limitations include simplified particle representations of agents which decouples control and locomotion, the lack of multimodal perception in virtual environments, the ne...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1703.10284  شماره 

صفحات  -

تاریخ انتشار 2017